Inhibition/Enhancement Network Based ASR using Multiple DPF Extractors

نویسندگان

Foyzul Hassan

Mohammed Rokibul Alam Kotwal

Mohammad Mahedi Hasan

Muhammad Ghulam

Mohammad Nurul Huda

چکیده

— This paper describes an evaluation of Inhibition/Enhancement (In/En) network for robust automatic speech recognition (ASR). In distinctive phonetic features (DPFs) based speech recognition using neural network, In/En network is needed to discriminate whether the DPFs dynamic patterns of trajectories are convex or concave. The network is used to achieve categorical DPFs movement by enhancing DPFs peak patterns (convex patterns) and inhibiting DPFs dip patterns (concave patterns). We have analyzed the effectiveness of In/En algorithm by incorporating it into a system which consists of three stages: a) Multilayer Neural Networks (MLNs), b) In/En Network and c) Gram-Schmidt (GS) orthogonalization. From the experiments using Japanese Newspaper Article Sentences (JNAS) database in clean and noisy acoustic environments, it is observed that the In/En network plays a significant role on the improvement of phoneme recognition performance. Moreover, In/En network reduces required number of mixture components in Hidden Markov Models (HMMs). Index Terms— Articulatory Features, Hidden Markov Model, Inhibition/Enhancement Network, Local Features, Multilayer Neural Network, Distinctive Phonetic Features.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Canonicalization of feature parameters for automatic speech recognition

Acoustic models (AMs) of an HMM-based classifier include various types of hidden variables such as gender type, speaking rate, and acoustic environment. If there exists a canonicalization process that reduces the influence of the hidden variables from the AMs, a robust automatic speech recognition (ASR) system can be realized. In this paper, we describe the configuration of a canonicalization p...

متن کامل

Canonicalization of Feature Parameters for Robust Speech Recognition Based on Distinctive Phonetic Feature (DPF) Vectors

This paper describes a robust automatic speech recognition (ASR) system with less computation. Acoustic models of a hidden Markov model (HMM)-based classifier include various types of hidden factors such as speaker-specific characteristics, coarticulation, and an acoustic environment, etc. If there exists a canonicalization process that can recover the degraded margin of acoustic likelihoods be...

متن کامل

Designing multiple distinctive phonetic feature extractors for canonicalization by using clustering technique

Acoustic models of an HMM-based classifier include various types of hidden factors such as speaker-specific characteristics and acoustic environments. If there exist a canonicalization process that represses the decrease of differences in acoustic-likelihood among categories resulted from hidden factors, a robust ASR system can be realized. We have previously proposed the canonicalization proce...

متن کامل

Phoneme recognition based on hybrid neural networks with inhibition/enhancement of distinctive phonetic feature (DPF) trajectories

In this paper, we introduce a novel distinctive phonetic feature (DPF) extraction method that incorporates inhibition/enhancement functionalities by discriminating the DPF dynamic patterns of trajectories relevant or not. The trajectories of each DPF show a convex pattern when the DPF is relevant and a concave one when irrelevant. The proposed algorithm enhances convex type patterns and inhibit...

متن کامل

Distinctive Phonetic Feature (dpf) Based Phone Segmentation Using 2-stage Multilayer Neural Networks

Segmentation of speech into its corresponding phones has become very important issue in many speech processing areas such as speech recognition, speech analysis, speech synthesis, and speech database. In this paper, for accurate segmentation in speech recognition applications, we introduce Distinctive Phonetic Feature (DPF) based feature extraction using a two-stage MLN (Multi-Layer Neural Netw...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

Journal of Multimedia

دوره 6 شماره

صفحات -

تاریخ انتشار 2011

Inhibition/Enhancement Network Based ASR using Multiple DPF Extractors

نویسندگان

چکیده

منابع مشابه

Canonicalization of feature parameters for automatic speech recognition

Canonicalization of Feature Parameters for Robust Speech Recognition Based on Distinctive Phonetic Feature (DPF) Vectors

Designing multiple distinctive phonetic feature extractors for canonicalization by using clustering technique

Phoneme recognition based on hybrid neural networks with inhibition/enhancement of distinctive phonetic feature (DPF) trajectories

Distinctive Phonetic Feature (dpf) Based Phone Segmentation Using 2-stage Multilayer Neural Networks

عنوان ژورنال:

اشتراک گذاری